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AMENDMENTS TO THE SPECIFICATION : 

(1) Please amend the paragraph beginning on page 4, line 9, and ending on page 4, 
line 28, to read: 

An example of such technique is disclosed in "Modeling and Efficient Decoding of Large 
Vocabulary Conversational Speech", Michael Finke, Jiirgen Fritsch, Detlef Koll, Alex Waibel, 
Eurospeech 1999 Budapest. In such document the recognition process, in particular the acoustic 
model evaluation, is sped up by a dynamic frame skipping technique. The frame skipping 
technique based on the idea of re-evaluating acoustic models only provided the acoustic vector 
changed significantly from a time t to a time t+1 . A threshold on the Euclidean distance is 
defined to trigger re-evaluation of the acoustics. To avoid skipping too many consecutive frames 
only one skip is allowed at a time, i.e. after skipping one frame the next one must be evaluated. 
Such method, based on the cepstrum distance between input parameters, is not accurate, as the 
distribution of the acoustic parameters is a "multimode" distribution, even in the same acoustic 
class. As a consequence, frames having a high cepstrum distance can actually belong to the 
same acoustic class. Moreover such method does not allow to skip more [[then]] than one frame 
at a time. 

(2) Please amend the paragraph beginning on page 5, line 4, and ending on page 5, 
line 9, to read: 

The Applicant observes that the accuracy of a recognition process can be maintained at 
high levels, even if more [[then]] than one consecutive input fram e s are frame is skipped in those 
regions where the signal is supposed to be stationary, provided that the distance between 
non-consecutive frames is measured with sufficient precision. 
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(3) Please amend the paragraph beginning on page 10, line 27, and ending on page 10, 
line 29, to read: 

If the phonetic variability V is lower [[then]] than 15% the threshold is set to 4.0, while it 
is set to 1.0 when the phonetic variability V is comprised between 80% and 100%. 

(4) Please amend the paragraph beginning on page 12, line 15, and ending on page 12, 
line 23, to read: 

2.1 When the buffer is not full the system is in a synchronous phase of lookahead buffer 
filling and contemporaneous releasing of likelihoods already calculated in a previous calculation 
phase. At every step of this phase a single frame is acquired from the front-end and buffered, 
releasing a pre-calculated lik o likhood likelihood . When the buffer is again full the calculation of 
the likelihoods can start, according to point 2.2. 
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